THE MIXING TIME OF THE NEWMAN WATTS SMALL WORLD 



Abstract. "Small worlds" are large systems in which any given node has only a few con- 
nections to other points, but possessing the property that all pairs of points are connected 
by a short path, typically logarithmic in the number of nodes. The use of random walks 
for sampling a uniform element from a large state space is by now a classical technique; to 
prove that such a technique works for a given network, a bound on the mixing time is re- 
quired. However, little detailed information is known about the behaviour of random walks 
on small-world networks, though many predictions can be found in the physics literature. The 
principal contribution of this paper is to show that for a famous small-world random graph 
model known as the Newman- Watts small world, the mixing time is of order log 2 n. This 
confirms a prediction of Richard Durrett, who proved a lower bound of order log 2 n and an 
upper bound of order log 3 n. 



The small-world phenomenon is a catchy name for an important physical phenomenon that 
shows up throughout the physical, biological and social sciences. In brief, the term applies 
to large, locally sparse systems (usually, possessing only a bounded number of connections 
from any given point) which nonetheless exhibit good long-range connectivity in the sense 
that there are short paths between all points in the system. The Erdos-Renyi random graph 
G np , is perhaps the most mathematically famous model possessing small- world behaviour: 
when p = c/n for c > 1 fixed, the average vertex degree is c, and the diameter of the largest 
connected component is [18] 



where c* < 1 satisfies ce c = c*e c and O p (l) denotes a random amount that remains bounded 
in probability as n — > oo. 

The Erdos-Renyi random graph is unsatisfactory as a small-world model in two ways: first, 
the network does not satisfy full connectivity (a constant proportion of vertices lie outside of the 
giant component); second, the graph is locally tree-like - for any fixed k, the probability that 
there is a cycle of length at most k through a randomly chosen node node is o(l). In real- world 
networks showing small world behaviour (social or business networks, gene regulatory networks, 
networks for modelling infectious disease spread, scientific collaboration networks, and many 
others - the book [16] contains many interesting examples), full or almost-full connectivity is 
standard, and short cycles are plentiful. Several connected models have been proposed which in 
some respects capture the desired local structure as well as small-world behaviour, notably the 
Bollobas-Chung [1], Watts-Strogatz [20], and Newman- Watts [14, 15] models. These models 
are closely related - all are based on adding sparse, long range connections to a connected 
"base network" which is essentially a cycle. 
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Understanding the behaviour of random walks on small-world networks remains, in general, 
a challenging open problem. Numerical and non-rigorous results for return probabilities [10], 
relaxation times [17], spectral properties [5], hitting times [3, 9], and diffusivity [7] appear 
in the physics literature, but few rigorous results are known. In [4], Durrett considers the 
Newman-Watts small world, proving lower and upper bounds on the mixing time of order 
log 2 re and log 3 re, respectively, and suggests that the lower bound should in fact be correct. 
The principle contribution of this paper is to confirm Durrett's prediction. (Theorem 1, below). 



To define the Newman-Watts small world, first fix integers 
n > k > 1. The (n,k)-ring R nt k is the graph with vertex set 
[n] = {l,...,re} and edge set {{i,j} :i+l<j<i+ k}, 
where addition is interpreted modulo n. (A picture of i?is,3 
appears to the right.) In particular, an (n, l)-ring is a cycle 
of length n, and whenever n > 2k the (n, k) ring is regular of 
degree 2k. For < p < 1, the (n,k,p) Newman- Watts small 
world ff n ,fc,p is the random graph obtained from the (n, /c)-ring 
by independently replacing each non-edge of the (re, fc)-ring by 
an edge with probability p. We write ff as shorthand for H n ^ yP 
whenever the parameters are clear from context. 

Given a (finite, simple) graph G = (V,E), by a lazy simple random walk on G we mean 
a random walk that at each step stays still with probability 1/2, and otherwise moves to a 
uniformly random neighbour. In other words, this is a Markov chain with state space V and 
transition probabilities 




Px,: 



\ \ix = y 

to if y G n g(x) 

otherwise 



H ~ " Jtv ■ •-■ „ ., 

scv 2 



where (1q(x) denotes the number of neighbours of x in G and Nq(x) denotes the collection of 
neighbours of x in G. (More generally, we shall say a chain is lazy if p xx > 1/2 for all x in the 
state space.) If G is connected then this Markov chain has a unique stationary distribution tt 
given by tt(x) = dc(x)/2\E\. 

Given two probability distributions [i, v on V, we define the total variation distance between 
fi and v as 

sup \fi(S) - i/(5)| = - \fi{v) - u(v)\, 

where /u(5) = Ylves^i 11 )- Now let (Xk)k>o be a lazy simple random walk on G, and write 
fik t x for the distribution of when the walk is started from x\ formally, for all y G G, 
fJ-k,x(y) = P {,Xt = y\X$ = x). The mixing time of the lazy simple random walk on G is 
defined as 

Tmix(G') = maxminjfc : \\jx kx - 7r|| TV < 1/4}. 

X 

(There are many different notions of mixing time, many of which are known to be equivalent 
up to constant factors - the book [11], and the survey [12] are both excellent references.) We 
may now formally state our main result. 

Theorem 1. Fix c > 0, let p = c/n, and let H be an (n,k,p) Newman-Watts small world. 
Then there is Cq > depending only on c and k such that with probability at least 1 — 0(n~ 3 ), 

Cq" 1 log 2 n < r mx (H) < C log 2 n. 

Furthermore, E[t mix (if)] <Q)(log 2 n + l). 
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The expectation bound in Theorem 1 follows easily from the probability bound. Indeed, 
given a finite reversible lazy chain X = (Xt,t > 0) with state space Q, for x G f2 write 
t x = min{i > : Xt = x} for the hitting time of state x. Then r MIX < 2 max^gQ K n (r x ) + 1, 
where denotes expectation starting from stationarity (see, e.g., [11], Theorem 10.14 (ii)). 
If X is a lazy simple random walk on a connected graph G = (V,E) with |V| = n, then 
maxjgv' ^(ti) < (4/27 + o(l))n 3 (see [2]). Assuming the probability bound of Theorem 1 and 
applying the two preceding facts we obtain that 

E[r MIX (tf )] < C log 2 n + ^n 3 (l + o(l)) • P (r MIX (iT) > C log 2 n) < C (log 2 n + 1), 

assuming Co is chosen large enough. 

The lower bound of Theorem 1 is also straightforward, and we now provide its proof. For 
n sufficiently large, given v G [n] and £ G N the probability that all vertices w of H n i cp with 
w G [v — £, v + £] mod n have degree exactly 2k, is greater than 

(l_ p )"(2*+l) > e -2 C (2£+l) ) 

the inequality holding since (1 — c/n) n > e _2c for n large. Taking a = l/(8c), it follows 
easily that with probability at least 1 — 0(n -3 ) there is v G [n] such that all vertices w with 
w G [v — alogn, v + alogn] mod n have degree exactly 2k. Furthermore, the random walker 
started from such a vertex v will with high probability take time of order log 2 n before first 
visiting a vertex in the complement of [v — a log n,v + a log n] mod n. Finally, under 7r, the set 
[v — a log n, v + a log n] mod n has measure tending to zero with n, and it thus follows from 
the definition of r MIX that the mixing time is of order at least log 2 n whenever such a vertex v 
exists. 

Having taken care of the expectation upper bound and lower bound in probability from 
Theorem 1, the remainder of the paper is now devoted to proving that with probability at 
least 1 — 0(n -3 ), t mix (H) = 0(log 2 n). In Section 2 we explain a conductance-based mixing 
time bound of Fountoulakis and Reed [6] which will form the basis of our approach. The 
Fountoulakis-Reed bound requires control on the edge expansion of connected subgraphs of 
Hn,k,p- To this end, in Section 3 we bound the expected number of connected subgraphs of 
Hn,k,p of size j, for each 1 < j < n; our probability bounds on the edge expansion of such 
subgraphs follow in Section 4. Finally, in Section 5 we finish the proof of Theorem 1. The 
proof is more straightforward when c is large; in order to get the key ideas across cleanly we 
accordingly handle the large-c and small-c cases separately. 

1.1. Notation. Given a graph G, write V(G) for the set of vertices of G, and E{G) for the 
set of edges of G. Also, given S C V(G), write G[S] for the subgraph of G induced by S. We 
say S is connected if G[S] is connected. Finally, given a formal power series F(z), we write 
[z^]F(z) to mean the coefficient of z J in F(z), so if F(z) = ^fc>o afc2;fc t nen [^^(z) = a,j. 

2. Mixing time via conductance bounds 

A range of techniques are known for bounding mixing times ([13] is a recent survey of 
the available approaches), many of which are tailor-made to give sharp bounds for particular 
families of chains. One particularly fruitful family of techniques is based on bounding the 
conductance of the underlying graph, a function which encodes the presence of bottlenecks at 
all scales. The precise bound we shall use is due to Fountoulakis and Reed [6]. Given sets 
S, T C V, write E(S,T) = Eq(S,T) for the set of edges of G with one endpoint in S and the 
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other in T, and write e(S,T) = \E(S,T)\. Also, given S C V write e(S) = YlveS ^g{ v )- The 
conductance of 5, written $(5), is given by 

e{S, S c ) 



e(S) 



For < x < 1/2, write 



= min ®(S); 

S connected 
x\E\<e(S)<2x\E\ 



one can think of <&(x) the (worst case) connected conductance of G at scale x. We will use the 
following theorem, a specialization of the main result from [6]. 

Theorem 2 ([6]). There exists a universal constant C > so that for any connected graph G, 

r m x(G)<C ^ 2 (2"')- 

i=l 

With this theorem at hand, proving mixing time bounds boils down to understanding what 
sorts of bottlenecks can exist in G. For the (n, k,p) Newman- Watts small world with p = c/n, 
it is not hard to see that small sets can have poor conductance. Indeed, in the introduction we 
observed that with high probability the ring R Ut k will contain connected sets S with ©(log re) 
nodes, to which no edges are added in H n ^^. Such a set S will have e(S, S c ) < k 2 and so 
will have conductance $(S) = 0(l/logn). It follows that in this case the best possible mixing 
time bound one can hope to prove using Theorem 2 is of order log 2 n. 

We will prove the upper bound in the probability bound of Theorem 1 by showing that 
there are constants e > 0, Co > such that with high probability, whenever | *S' | > Co log re, 
we have <&(S) > e. To accomplish this using Theorem 2, we will need control on the likely 
number of connected subgraphs of H n ^, p of size s, for all s > log re. In the next section, we 
bound the expected number of such subgraphs using Lagrange inversion and comparison with 
a branching process. 



3. Counting connected subgraphs 



Fix c > and k > 1, let p = c/n, and let H = H n ^ tP be a Newman- Watts small world. Let 
v E [n], write Bj )V = Bj )V (H) for the set of all S C [n] containing v with \S\ = j such that 
H[S] is connected, and let Bj = \J Bj jV . Our aim in this section is to establish the following 

v e[n] 

proposition. 

Proposition 3. For any positive integer j and any v £ [re], E|i?j it) | < (4(c + 2k)) 3 , and 
E\Bj\ < n(4(c + 2k))i . 

We will prove Proposition 3 by comparison with a Galton-Watson process. Recall that a 
Galton-Watson process can be described as follows. An initial individual - the progenitor - 
has a random number Z\ of children, where Z\ is some non-negative, integer-valued random 
variable. The distribution of Z\ is called the offspring distribution. Each child of the progenitor 
reproduces independently according to the offspring distribution, and this process continues 
recursively. The family tree of a Galton-Watson process is called a Galton-Watson tree, and 
is rooted at the progenitor. 

The number of neighbours of a vertex in H n ^ )P is distributed as Bin(n — 2k — + 2k. 
From this it is easily seen that \Bj V \ is stochastically dominated by the number of subtrees of 
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size j containing the root in a Galton-Watson tree with offspring distribution Bin(n — 2k — 
l,p) + 2k. To bound the expectations of the latter random variables, we will first encode 
these expectations as the coefficients of a generating function, then use the Lagrange inversion 
formula ([19], Theorem 5.4.2), which we now recall. This approach was suggested to us by 
Omer Angel in a mathoverflow comment (http://mathoverflow.net/questions/66595/); 
we thank him for the suggestion. 

Theorem 4 (Lagrange inversion formula). If G(x) is a formal power series and f(x) = 
xG(f(x)), then 

n[x n ]f{x) k = k[x n - k )G{x) n . 

Fix a non-negative, integer- valued random variable B, and for m > write p m = P (B = m). 
Given a Galton- Watson tree T with offspring distribution B, let fij = fij(B) denote the 
expected number of subtrees of T containing the root of T and having exactly j vertices (so 
Ho = 0). Also, write 

Qj = Y Pm ( m ">ii 

m>j 

where (m)j = m\/(m — j)l is the falling factorial. Note that qj is the expected number of 
ways to choose and order precisely j children of the root in T. Let F(z) = Yl^jLo^j^ an d 
Q(z) = X^j=o Qj zJ b e t ne generating functions of fj,j and qj respectively, viewed as formal power 
series. 

Lemma 5. F(z) = zQ(F(z)) 



Proof. We have 



Q(F(z))=J2Qj E 



j>0 
j>0 



, r>l 



r>j r i+- 



i 

) 



r>0 j<r 



( 



7'lH \-rj—r 

\n,— ,r £N+ 



(1) 



/ 



The r'th term in the outer sum in (1) encodes subtrees of T with r + 1 vertices that contain 
the root, as follows. First specify the degree j of the root of the tree T to be embedded. Then 
choose which j children of the root of T will form part of the embedding, and the order in 
which the children of the root of T will be mapped to these nodes (there are qj ways to do this 
on average). Next, choose the sizes ri, . . . , r,- of the subtrees of the children of the root in the 
embedded tree; finally, embed each such subtree in the respective subtree of T; on average, 
there are n n ways to do this. It follows that 

(1)=!E /+ Vhi (2) 

r>0 



1 
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which proves the lemma. (We remark that verifying that (1) and (2) are equal can be done 
purely formally; however, we find the preceding explanation more instructive.) □ 

Lemma 6. Fix C > 0. if qj < C j for all j > then ft < jC^tiJC '" 1 < (4Cy _1 for all 

;>i- 



Proof. By Lemma 5 and Theorem 4, we have j[z J ]F(z) k = k[z J k ]Q(z) J . In particular, taking 
k = 1, we have ft = [z j ]F(z) = i^'-^Q^'. Now, 



\ 



r>0 



E «1 



z . 



J 



Therefore [z 3 ' l \Q{z) 3 = QhQh " ' %■ Each summand is at most C 3 1 by 

assumption. There are ( 2 jZi) nonnegative integer solutions to the equation l\ H = j — 1, 

so we obtain that [z 3 ' 1 }^)^ < (^jf) • C" _1 . The result follows. □ 



The next lemma controls the growth of qj for some important special offspring distributions, 
which allows us to use Lemma 6 to prove Proposition 3. 

Lemma 7. If B is Poisson(c) distributed then qj = c 3 for all j. Also, if B is Bin(n, c/n) 
distributed, then for all j > 0, qj < c J . Finally, if B — £ is Bin(ra, c/n) distributed for some 
fixed i > 0, then for all j > 0, qj < (c + £) 3 . 



Proof. If B is Poisson with mean c, then 

m>j v J> v 7 

If B = Bin(n, c/n) then 




=(^-»(n-i)-(»- J+ i)i: (;::)(^r( i -r m 

j<m<n x ' 

Finally, if B = Bin(n, c/n) + I then we consider three Galton-Watson trees 71,72,73 with 
offspring distributions B\ = l,B2 = Bin(n, c/n) and B% = Bin(n, c/n) +£ respectively. For 
i = 1,2,3 write = qj(Bi). Since q~p is the expected number of ways to choose and order 
precisely j children of the root in 71, we have q~p = (l)j < V and by the previous argument 

(2) ' (3) 

we know that q - < c 3 . Finally, since q - is the expected number of ways to choose and order 
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precisely j children of the root in 7%, by independence we have 

3 



o— n \ / 



s=0 

= (i + c y. 

The factor in the first equation is because as long as we choose s positions for children 
coming from the deterministic component of offspring distribution, the order of all j children 
are fixed since the order among s children and the order among the other j — s are both 
fixed. □ 

We remark that if T has deterministic d-ary branching (every node has exactly d chil- 
dren with probability one), then for all j, the number of subtrees containing the root and 
having precisely j nodes is exactly (Thm 5.3.10 in [19]) which is bounded above by 

jz^j /j < (eciy . Thus, Lemma 6 shows that when factorial moments grow only expo- 
nentially quickly, the values fij behave roughly as in the case of deterministic branching. We 
also note that when T has Poisson(c) branching distribution, Lemma 6 and the argument of 
Lemma 7 together yield the exact formula [ij = ( 2 jZ±) ■ 

Proof of Proposition 3. For any v E V{Hn,kj>)> the random variable \Bj V \ is stochastically 

dominated by fij(B), where B = Bin(n, c/n) + 2k. By Lemma 7, we have that qj(B) < (c+2ky 
for all j. It then follows from Lemma 6 that qj < (4(c + 2k))i~ 1 for all j, proving the 
proposition. □ 



4. Bounding the expansion of connected subgraphs of H n ^ :P 

Recall from the preceding section that Bj is the collection of connected subsets S of V{H n ^, p ) 
with \S\ = j. We will show that with high probability, for all j > logn, all elements S of Bj 
have conductance uniformly bounded away from zero. Many of our proofs are easier when c is 
large, and we treat this case first. 

In the course of the proofs we will make regular use of the standard Chernoff bounds (see, 
e.g., [8] Theorem 2.1) which we summarize here: 

Theorem 8. If X = Bin(m, q) then 

for all < x < 1, P (X < (1 — x)mq) < exjp(—mq(j)(—x)) < exp(— mqx 2 /2), 
and for all x > 0, P (X > (1 + x)mq) < exp(—rnq(j)(x)) < exp(— mqx 2 /2(1 + x)), 
where (j)(x) = (1 + x) log(l + x) — x. 

We will use the coarser bounds most of the time. The finer bound will only be used twice, 
once in the proof of Lemma 13 and once in the proof of Theorem 1. 

We will also have use of the FKG inequality ([8], Theorem 2.12), which we now recall. Let 
r = [n] = {1,2,- •• ,n}. Given < pi,-- - ,p n < 1, r pi) ... )Pn C [re] is obtained by including 
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element i with probability p, independently for all i. We say a function / : 2 r — > IR is increasing 
if /(A) < f(B) for Ac B, f decreasing if /(A) > /(£) for icB. 

Theorem 9 (FKG inequality) . // the random variables X% and X 2 are two increasing or two 
decreasing functions ofT pit ... tPn , then 



In particular, if Qi and Q 2 are two increasing or two decreasing families of subsets ofT, then 



P (Tpx.... , Pn e Qi n Q 2 ) > P (r pi ,.. , Pn e &) P (r pi ,.. iPn e Q 2 ) • 



We will in fact use the following, easy consequence of the FKG inequality. 
Corollary 10. If C is an increasing event, A is a decreasing event, then 



We begin by bounding the edge expansion of all but the very large connected sets, in the 
case that c is large. 

Lemma 11. Fix c sufficiently large that c/720 — log(4(c + 2k)) > 5. Then for all n, 



Proof. Fix j £ [logn, 9n/10], and S C [n] with | *S' | = j. Note that E(S,S C ) is independent of 
H[S], so given that S S Bj, e(S, S c ) stochastically dominates a Bin(j(n— j),p) random variable. 
Since j(n — j) > jn/W, it follows that under this conditioning e(S,S c ) also stochastically 
dominates X, a Bin(nj'/10,p) random variable. By a union bound it follows that 



P (35, \S\ = j, H[S] connected, e{S, S c ) < cj/ 12) 

< P ( e ( 5 ' ^ °i/ 12 I H l S } connected) • P (H[S] connected) 
S,\8\=j 

< P{X < cj/ 12) ■ E|Bj| 

< e -ci/720 . n (4( c + 2 jfc))i J 



the last line by a Chernoff bound and by Proposition 3. (This is a typical example of our use 
of Proposition 3 in the remainder of the paper.) 

By our assumption that c/720 — log(4(c + 2k)) > 5 and since j > logn, we obtain that this 
probability is at most 



The next lemma provides a lower bound on the edge expansion of very large sets, again in 
the case that c is sufficiently large. 

Lemma 12. If c > 40 then for all n sufficiently large, 



E{XtX 2 ) > E(X 1 )E{X 2 ). 



P(C I A) < P(C). 





The result follows by a union bound over j £ [logn, 9n/10]. 



□ 



P(3S C [n] : |5| > 9n/10,e 



(S) < \E(H)\) < (2/e) 
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Proof. In this proof write E = E(H). Since, for any set S C [n], e(S) + e(S c ) = 2\E\, it suffices 
to prove that 

P (35 C [n] : \S\ < n/10, e(S) > \E\) < (2/e) n . 

Fix any set S with \S\ < n/10. Write e*(S) = Y^ves \{e 3 v : e ^ E{R njk )}\ for the total 
degree incident to S not including edges of the ring R U: k, and similarly let E* = E \ E{R n j t ). 
Since \S\ < n/10 < n/2, in order to have e(S) > \E\ we must in fact have e*(S) > \E*\. Also, 

e*(S) is stochastically dominated by Bin(n 2 /10,p), and \E*\ = Bin(n(n — 1 — 2k)/2,p). When 
n is large enough that n — 1 — 2k > 4n/5, we thus have 

P (e(S) > \E(H)\) < P (\E*\ < cn/5) + P (e*(S) > cn/5) 

< P {\E*\ < c(n - 1 - 2k)/ 4) + P (e*(S) > cn/5) 

< exp (-c(n - 1 - 2k) /16) + exp (-cn/40) , 

by a Chernoff bound. For n sufficiently large the last line is at most 2e _cn//4 ° < 2e~ n , and 
the result follows by a union bound over all S with \S\ < n/10 (there are less than 2 n_1 such 
sets). □ 

A similar but slightly more involved argument yields the following result, which will be 
useful for dealing with smaller values of c. 

Lemma 13. For any c > there is j3 = /3(c) > such that for all n sufficiently large 
P (3S C [n] : \S\ > (1 - (3)n, e(S) < \E(H)\) < (1 - (3) n . 

Proof. As in the proof of Lemma 12, it suffices to prove that for some (3 > 0, 

P (3S C [n] : \S\ < /3n, e(S) > \E\) < (1 - /3) n . 

Furthermore, since P (35 C [n] : IS"! < (3n,e(S) > \E\) decreases as /3 decreases, it suffices to 
find /3 > and e > such that for n sufficiently large, 

P (3S C [n] : \S\ < fin, e{S) > \E\) = 0(e~ en ). 

We fix < (3 < l/(3e) small enough that 1/(2/3) - 8k/c > 1 + 1/(3/3). Additionally, recalling 
the function (p(x) = (1 + x)log(l + x) — x from Theorem 8, we choose f3 small enough that 
0(l/(3j9)) > log(l/(3/3))/(6/3). Finally, we assume /3 < c/36. 

For any S C [n] with jSj < /3n and with e(S) > \E\, defining e*(S) and E* as in the proof 
of Lemma 12, we then have e*(S) > \E\ - 2k\S\ > \E\ - 2k(3n. 

P(3SC [n] : \S\ < pn,e(S) > \E\) 

< P (3S C [n] : \S\ < fin, e*(S) > \E\ - 2k(3n) 

< P < {^\p-2k^n\ +P (3S C [n] : |5| < /3n,e*(S) > Qp-^nJ . (3) 
Since \E\ stochastically dominates Bin((/^),p), by a Chernoff bound we have 

P I \E\ < I ™ ]p- 2A;/3n ] < exp ( -\\ n )p ' 2? ' 



2 J 1 - -"""") 2\2j*\(»)p 

< exp ( -— • n j , (4) 

which handles the first summand in (3). For the second summand, let X be Binomial(/3n(n — 
l),p) distributed, and note that for all S C [n] with IS"! < /3n, e*(S) is stochastically dominated 
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by X. Also, for j3 < 1/3 there are less than 2(^j) subsets of [n] of size less than fin, and 
follows by a union bound that 



p(3Sc[n]:\S\<f3n,e*(S)> Q 



Since l/(p(n — 1)) = n/{c{n — 1)) < 2/c for all n > 2, and by our assumption that 1/(2/3) 
c> 1 + 1/(3/3), we have 



By the sharper of the Chernoff upper bounds in Theorem 8 and by our assumption that 
0(1/(3/3)) > log(l/(3/3))/(6/3), it follows that 

P ( X > (2 V ~ 4A:/3n ) - 6XP ( _/3n ( n ~ 1 ) p ' ^(V(3^))) 



< exp 



log(l/(3/3)) ^ 
-/3n(n - l)p — I 



clog(l/(3/3)) 
6XP ' 12 n 



for n sufficiently large. 

Combined with (5) this yields 

P {^S C [n] : \S\ < fin, e*(S) > (^)p~ 

<< L ;„ J )-(- £M F 2 " 

/e\^ / clog(l/(3/3)) 

^ 2 {p) eXP v 12 U 

= 2 exp (n (/3 + /3 log(l//3) - (c/12) log(l/(3/3)))) 
< 2exp(-(c/36)n) , 

where in the last inequality we used that /3 + /31og(l//3) < 2/31og(l//3) < (c/18) log(l//3) and 
that log( 1/(3/3)) > 1. Together with (3) and (4) we obtain 

P{3S C [n] : \S\ < 0n,e(S) > \E\) < exp I ^— ■ n \ + 2 exp (-(c/36)n) , 

which completes the proof. □ 



In order to use Lemma 11 to bound the conductance of connected subsets S of V(H n k p ) of 
size at most 9n/10, we need to know that for such subsets we have e(S) = 0(15*1) with high 
probability. Such a bound is provided by the following lemma. 

For given k, let x = Xk be the positive solution of equation x/720 — log(4(x + 2k)) = 5, and 
let M = M(c, k) = k + 1 + 10max(xfc, c). We remark that xt > 40 for all k>l. 

Lemma 14. For all c > and for all n, 

PI35G \J Bj,e(S,S) > M-max(|5|,logn) I < ^ 

\ l<i<" / " 
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Proof. First note that for fixed M, the event whose probability we aim to bound is increasing, 
so increasing p only increases its probability of occurrence. Since M(c, k) is constant for c < x^, 
it thus suffices to prove the bound for c > Xk, and we now assume that c > Xk- Note that in 
this case c/720 — log(4(c + 2k)) > 5. For all n, and any j G [re], we have 

P (3S G Bj, e(S, S) > (£; + 1 + 10c) max(j, logn)) 

< P (e(S, S) > (k + I + 10c) max(j, logn), S £ Bj) . (6) 

ScH,|S|=j 

Write Ts for the set of all possible trees on vertex set S - so |T S | = ISpl" 2 - and list the 
elements of Ts as t\, . . . , t r . For i G [r], let Fi be the event that U is a subgraph of H, and let 
Ei = Fi \ U,<i F] be the event that ti is a subgraph of H but none of tx, . . . ,t%-\ are subgraphs 
of H. The events Ei partition the event that S G Bj, so 

P (e(5, S) > (k + 1 + 10c) max(j, log re) | S £ Bj ) 
< maxP(e(S',5) > (fe + 1 + 10c) max(j, log n) \ Ei) (7) 

i£[r] 

For fixed i G [r], write Pj (•) for the conditional probability measure P (• | Fi), and write 
rW = {nu : re, u G 5} \ E(tj). Under P i; the set ^(S 1 , 5) \ E(U) is distributed as a Binomial(p) 
random subset of since, after conditioning that ti is a subgraph of H, those edges not in 
U still appear independently. 

Write C for the event that \E(S, S) \ E(U)\ > |(fc + 1 + 10c) max(j, logre)| - (j - 1). Then 



P (e(5, S) > (Jfe + 1 + 10c) max(i, log re) | = Pj C 



Since C is increasing and Hj<i -^7 ^ s decreasing, it follows from Corollary 10 that 

(8) < P* (C) = P S) \ E{U)\ >(k + l + 10c) max(j, logn) - (j - 1)) , 

the last equality holding since E(S, S)\E(ti) is disjoint from E(ti), so independent of Fj. Now, 
S) \ E(ti)\ is stochastically dominated by kj + Bin(max(j, log re) •n/2,p). Letting X have 
distribution Bin(nmax(j, logre)/2,p), respectively, for all i G [r] we thus have 

P (e(5, 5) > (jfe + 1 + 10c) max(j, logn) | Ei) 

< P (kj + (j - 1) + X > (k + 1 + 10c) max(j, log n)) 

< P (X > 10c max(j, logn)) 

< e -9cmax(j,logn)/2 

the last inequality holding by a Chernoff bound. It then follows from (6) and (7) that 

P (3S G Bj, e(S, S) > (k + 1 + 10c) max(j, logn)) 

< V e -9 C max(j,logn)/2p^ g ^ 



Sc[n],\S\=j 
g— 9cmax(j',l( 

< e -9cmax(i,logn)/2 n ^( c + 2 ^y 



-9cmax(j,logn)/2-g|^.| 
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the last inequality by Proposition 3. By assumption, c is large enough that c/720 — log(4(c + 
2k)) > 5, and it follows that 

P (3S e Bj, e(S, S) > (k + 1 + 10c) max(j, logn)) 

<^ glog n+log(4(c+2fc)) max(j',logn)— 9cmax(j',logn)/2 

< n~ 4 . 

A union bound over j £ [n] completes the proof. □ 

5. Proof of Theorem 1 

As noted in the introduction, the case when c is large is more straightforward, and we handle 
it first. 

Proof of Theorem 1 assuming c > Xk ■ For x > write 

<S> (x) = mm\ y ' ; : S connected, 151 < — , xn(c/2 + k)/2 < e(S) < 4xn(c/2 + k) 
I e(S) 10 

Let A be the event that for all S with |5| > 9n/10 we have e(S) > \E{H)\. If A occurs then 
for all < x < 1/2, for all S with e(S) < 2x\E(H)\ we have e(S) < \E(H)\ so \S\ < 9n/10. 
Also, let A' be the event that n(c/2 + k)/2 < \E(H)\ < 2n(c/2 + k). If A' occurs then for all 
x > all S with x\E(H)\ < e(S) < 2x\E(H)\, we have xn(c/2 + k)/2 < e(S) < 4xn(c/2 + k). 
It follows that on A n A', for all < x < 1/2 we have ®o{x) < 

We have \E(H)\ = nk + Bin(n(n — 2k — l)/2,p), so by a Chernoff bound, for all n large 
enough, P (A 1 ) > 1 — n -3 . Also, by Lemma 12, P (^4) > 1 — n -3 for all n sufficiently large. It 
follows that with probability at least 1 — 2n~ 3 , for all < x < 1/2 we have &o(x) < <&(x), and 
thus with probability at least 1 — 2n -3 , 

Rog 2 \e\] ri°g 2 \m 

^ $- 2 (2-)< Yl $ o 2 (2- i )- 

i=l i=l 

We thus focus on bounding the latter quantity. Note that the sets considered when bounding 
$(2~ l ) decrease (in terms of e(S)) as i increases. Also, recall the definitions of Xk and M = 
M(c, k) from just before the statement of Lemma 14. 

First suppose i > |log 2 (ra(c/2 + k)/(8M logn))J and write 

i = Uog 2 (n(c/2 + fc)/(8M bgn))J + j. 

For all S considered when bounding <3?o(2~ 4 ) we have e(S) < 2 _:,+6 Mlogn and e(S,S c ) > 2, 
so 

riog 2 \E\) - Llog 2 (n(c/2+fc)/(8M log n))J 

$ -2^ 2 -Llog 2 (n(c/2+fc)/(8Aflogn))J-j- ) 

°° 4 6 M 2 
< 4 5 M 2 log 2 n V 4" J = log 2 n. 

3=0 3 

Next suppose that i < log 2 (n(c/2 + k) /(8M logn)). In this case we have e(S) > 4Mlogn. 
By Lemma 14, with probability at least 1 — n -3 , for all connected sets S with \S\ < logn we 
have e(S,S) < Mlogn. Since e(5) > 4Mlogn this implies that e(S,S c ) = e(S) - 2e(S,S) > 
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e(S) - 2Mlogn > e(S)/2, so $(5) > 1/2. Also, by Lemmas 11 and 14, with probability at 
least 1 — 2n~ 3 , for all connected sets S with |5| > logn, we have 

e(S, S c ) = e(S,S c ) c\S\ 1 c 

e(5) e(S,S c ) + 2e(S,S) ~ 12 c|5|/12 + 2M\S\ ~ 36M' 

so $(5) > c/(36M). Since 1/2 > c/(36M) it follows that with probability at least 1 - 3n" 3 , 
for all i < log 2 (n(c/2 + k)/(8Mlogn)) we have <&o(2~') > 3^7, and in this case 

Llog 2 (n( C /2+fe)/(8A/logn))J 4 - 

£ $ - 2 (2^)<^log 2 n. 

i=l 

Combining these bounds, we see that with probability at least 1 — 3n -3 , 

2, n 4 6 M 2 2 6 4 M 2 
^ *„ ( 2 ) < log 2 n + — ^- log 2 n, 

i=i d c 

so with probability at least 1 — 5n~ 3 , X^[=i 2 '" B '~' & 2 {2~ l ) is at most the same quantity. By 
Theorem 2 it follows that with probability at least 1 — 5n~ 3 , 

4 6 M 2 log 2 n 6 4 M 2 

— 1 

3 c 2 

This completes the proof in the case c > x^. □ 



Tmix(G9 < C ( 1 log 2 n 



For the remainder of the paper, fix < c < Xf., and let R = [max(fe, 2xi/c)] . Also, recall the 
constant ft = j3{c) from Lemma 13. The remaining case of Theorem 1 follows straightforwardly 
from the following lemma. 

Lemma 15. There is a = a(c) > such that for all n sufficiently large, 

3R 3 



n 3 



P 35 € |J Bj(H),e H (S, S c ) <a\S\]< 

\ Rlogn<j<(l-/3)n 

We provide the proof of Lemma 15 at the end of the paper. 

Proof of Theorem 1 assuming c < x^ . For x > write 
f e(S S c ) 

$ (x) = min <^ v ' . ' : S connected, 151 < (1 - P)n, xn(c/2 + k)/2< e(S) < 4xn(c/2 + k) 
I e(5) 

As in the case c > Xk, by a Chernoff bound and by Lemma 13, for all n sufficiently large, with 
probability at least 1 — 2n -3 we have 

rio g2 \e\] ri°g 2 i^n 

<J>" 2 (2- i )< £ $ 2 (2^), 
i=l i=l 

and the remainder of the proof is just as in the case c > x^, but using Lemma 15 in place of 
Lemma 11. □ 



It thus remains to prove Lemma 15; before doing so, we briefly describe our approach. We 
shall divide vertices of H into groups of size R, each containing R consecutive vertices. We 
view each group as a new single vertex; two new vertices are connected if there is an edge 
connecting their constituent sets. This yields an auxiliary graph H', whose distribution is that 
of an {n' , l,p') Newman- Watts small world, for suitable n' and p' . We will shortly see that 
p' = d /n' for some d > x±, so all "large-c" results can be applied to H' . 
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To translate edge expansion results from H' into corresponding results for H, we proceed 
as follows. Given a set S of vertices of H , we consider the blow-up S + of S, which is the 
collection of all vertices of H belonging to the same group as some element of S. The idea is 
that in most cases, the event that e(S,S c ) is small relative to \S\ should be nearly identical 
to the event that e(S + , (S + ) c ) is small relative to \S + \. If this were always true, Lemma 11 
would then yield bounds for the number of edges leaving S + , which would in turn yield strong 
bounds on the probability that e(S + , (S + ) c ) < e \S + \, where e > will be a function of R. 

The above line of argument relies upon the intuition that the size of the blow-up S + should 
be essentially a constant factor greater than that of S. Since the ratio |5 + |/|5| is in fact a 
random quantity, to make the above argument work, we end up needing to additionally show 
that e(S + , (S + ) c ) is very unlikely to be large if e(S, S C )/\S\ is extremely small. In order to 
quantify the notion of "extremely small", we are forced to introduce a third parameter 5 > 
with 5 much smaller than e. We now turn to the details. 

Proof of Lemma 15. We assume for simplicity that R divides n - the general case is practically 
identical - and write n' = n/R. For i G [n'\ let W{ = {(i — 1)R + j, 1 < j < R}. We form 
an auxiliary graph H' = (V , E') with V = {w{,i G [n 1 ]} by adding an edge between u>i 
and Wj if there is some edge from an element of Wi to an element of Wj in H. It is easily 
verified (using the fact that R > k) that H' is an (n' , l,p') Newman- Watts small world, with 
p' = P (Bin(R? ,p) > 0) > Rc/2n' = c'/n' (where d = Rc/2) for all n sufficiently large. Note 
that since d = Rc/2 > x\, it follows that we may apply Lemma 11 to H' - this is the only 
way we will use this bound on R. 

Given S C [n], write I' = {i G n' : WiCiS ^ 0}, let S' = {wi, i G I'}, and write S + = (JieS 7 Wi - 
Now write j = \S\. As in the proof of Lemma 14, we partition the event that S G Bj(H) into 
E\(S), . . . , E r (S) according to the first spanning tree appearing in S. Fix e = c/ (12R(2Rc+ 1)) 
and let S > be small enough that ec > 2k5 and that eRc (log (| ) — l) > 5 + log(4(c + 2k)). 
For all 1 < i < r we then have 

P {e H {S + \ S, (S + ) c ) > 2eRcj \ e H (S,S c ) < 6j,Ei(S)) 

< P (e H (S + \ S, (S + ) c ) > (ec + 2k5)Rj \ e H (S, S c ) < 6j, E^S)) 

< P [e H (S + \ S, (S + ) c ) > (ec + 2k5)Rj \ e H (S, S c ) < 6j, U C H) . 

The first inequality is true since we pick 5 such that ec > 2k5 and the second inequality holds 
by Corollary 10. Now write g(S) = \{i G S' : \S H Wi\ < \wi\}\, so g(S) is the number of sets 
Wi that intersect S but are not covered by S. It is easily checked that e#(5, S c ) > g(S); the 
extremal case is that for each i G S' , en(S D Wi,Wj \ S) = 1, while for i, j G S' with i ^ j, 
e H (wi,Wj) = 0. 

Next, suppose that S C [n] satisfies en(S,S c ) < 5j. Then we must have g(S) < Sj, and 
it follows that \S + \ S\ < R5j. Under this conditioning, en(S + \ S, (S + ) c ) is stochastically 
dominated by 2kR5j + Bm(R5jn,p), and is independent of the event that tj C H since they 
are determined by disjoint sets of edges, so by the finer of the Chernoff upper bounds, 

P (e H (S + \ S, (S + ) c ) > (ec + 2k5)Rj \ e H (S, S c ) < Sj, U C H) < exp {-eRcj (log - l)) . 

It follows that for j > R log n, writing 

5i = {S C [n],S G B J (H),e H (S,S c ) < 5j,e H (S + \S, (S + ) c ) > 2eRcj}, 



THE MIXING TIME OF THE NEWMAN-WATTS SMALL WORLD 



15 



we have 



E l 5 i|< E E V{Ei(S),e H (S,S c ) <5\S\,e H (S + \S,(S + ) c ) > 2eRc\S\) 

Sc[n],\S\=j l<i<r 

< exp (-eRcj (log (^j - l)) E|{5 C [n],\S\ = j,H[S\ connected}] 

< exp (-eRcj (log (|) - l)) • n ■ (4(c + 2k)) j 
<n" 4 

the second-to-last inequality by Proposition 3, and the last since we chose 5 such that 

eRc (log (|) - l) > 5 + log(4(c + 2k)). 

and since j > Rlogn > logn. It follows by a union bound over R log n < j < 9n/10 and 
Markov's inequality that 



P[35G (J Bj(H),e H {S,S c ) < 5\S\,e H (S + \ S,(S + ) C ) > 2eRc\S\ j < ^ 

\ Rlogn<j<9n/10 / 



(9) 



Next, write 
S 2 = 



Se [j B 1 (H),e H (S,S c )<e\S\,e H (S + \S,(S + ) c )<2eRc\S\ 

Rlogn<j<9n/(10R) 



For any S G 52, we have 

e H (S + , (S+) c ) = e H (S, (S + ) c ) + e H (S + \S, (S + ) c ) 
<e H (S, S c ) + e H (S + \S,(S + ) c ) 

< e(2Rc + l)\S\ 

< e(2Rc + l)R\S'\ 
= c'\S'\/12. 

the last equality by our choice of e. It follows that e#/(«S", (5") c ) < c'\S'\/12. Furthermore, 
since \S\ > iilogn we have |S"| > logn > logn', and since \S\ < 9n/(WR) we have \S'\ < 
9n/(WR) = 9n'/W. It follows by Lemma 11 that 

P [j B j (H),e H (S,S c )<e\S\,e H (S + \S,(S + ) c )<2eRc\S\ 

\ Rlogn<j<9n/(10R) 

< pUs'G (J B 3 (H'),e H ,(S',(Sr)<c'\S'\/l2\ < ^ (10) 

\ logn'<j<9n'/10 / ^ ' 

Next, for any m > 1, if en(S, S c ) < m then, viewed as a subset of a cycle of length n, S 1 must 
have at most m connected components. The number of subsets of an n-cycle with at most m 
connected components is 2n( n ^"^ 1 ). (This is a straightforward combinatorial exercise but 
may be seen as follows: the factor n chooses a starting point on the cycle, the factor ("^m"^ ^ 
chooses the points on the cycle at which membership in S alternates, and the factor 2 accounts 
for whether or not the starting point belongs to S.) 
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It follows that for any 7 > 0, the number of subsets of an n-cycle with at most jn connected 
components is at most 



2n\ , , < zn 

L 2 7 n J / V 2 7 n 



2-yn 



exp ( log(2n) + n ■ 27 ( 1 +log ( 1 + — ) ) ) (11) 



2 7 

Since x(l + log(l/(2x))) — >• as x \. 0, we may choose < 7 < 9/3c/(20i?) small enough that 
(11) is at most exp(n • 9/3c/(160i?)) for all n sufficiently large. 

Finally, for a fixed set S with 9n/(l(XR) < \S\ < (1 - /3)n, the probability that e H (S,S c ) < 
is bounded above by P (Bin(^^ ■ f3n,p) < jn) since en(S,S c ) stochastically dominates 
Bin(\S\{n-\S\),p) and \S\ > n - \S\ > /3n. Since ^ < ^ by a Chernoff bound we have 

P (e H (S, S c ) < 7 n) < P fBm(^ • /3n,p) < 7 n) < exp (-^n) . (12) 

Since all sets S C [n] with at least jn components (still viewed as subsets of the n-cycle) 
have e(S,S c ) > 772, it follows by (11), (12), and a union bound over sets with at most -yn 
components that for all n sufficiently large, 

p( 3S£ U ^.«-W^<7n)< 2 »(-| i I^J)-p(-^-) 

V 9n/(10B)<i<(l-/9)n / V 7 

/ 9/3c 

- exp v "^ n 

<^3- (13) 
Writing a = min(7,e,<5), it follows from (9), (10), and (13) that 

P 35 G J £,(#), e^(S,S c ) <q|5| J < " 

\ ^logn<i<(l-/3)n / 

Since n' = n/R this completes the proof. □ 
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